perf: Batch metadata query to avoid N+1 across 5 call sites by KRRT7 · Pull Request #232 · microsoft/typeagent-py

KRRT7 · 2026-04-10T10:02:32Z

Stack: 4/4 — depends on #230. Merge #231, #229, #230, then this PR.

Five call sites used get_item() per scored ref — one SELECT and full deserialization per match (N+1 pattern)
Added get_metadata_multiple to ISemanticRefCollection that fetches only semref_id, range_json, knowledge_type in a single batch query
Replaced the N+1 loop with one get_metadata_multiple call at each site
Further optimized scope-filtering: binary search in contains_range, inline tuple comparisons in TextRange, skip pydantic validation in get_metadata_multiple

Call sites optimized

lookup_term_filtered — batch metadata, filter by knowledge_type/range
lookup_property_in_property_index — batch metadata, filter by range scope
SemanticRefAccumulator.group_matches_by_type — batch metadata, group by knowledge_type
SemanticRefAccumulator.get_matches_in_scope — batch metadata, filter by range scope
get_scored_semantic_refs_from_ordinals_iter — two-phase: metadata filter then batch fetch

Additional optimizations

Binary search in TextRangeCollection.contains_range: replaced O(n) linear scan with bisect_right keyed on start, reducing scope-filtering from ~25ms to ~9ms
Inline tuple comparisons in TextRange: replaced TextLocation allocations in __eq__/__lt__/__contains__ with a shared _effective_end returning tuples
Skip pydantic validation in get_metadata_multiple: construct TextLocation/TextRange directly from JSON instead of going through __pydantic_validator__

Benchmark

Azure Standard_D2s_v5 — 2 vCPU, 8 GiB RAM, Python 3.13

Query (pytest-async-benchmark pedantic, 200 rounds)

200 matches against a 200-message indexed SQLite transcript. Only the function under test is timed.

Function	Before (median)	After (median)	Speedup
`lookup_term_filtered`	2.650 ms	1.184 ms	2.24x
`group_matches_by_type`	2.428 ms	978 μs	2.48x
`get_scored_semantic_refs_from_ordinals_iter`	2.541 ms	2.946 ms	0.86x
`lookup_property_in_property_index`	25.306 ms	9.365 ms	2.70x
`get_matches_in_scope`	25.011 ms	9.160 ms	2.73x

Reproduce the benchmark locally

pip install 'pytest-async-benchmark @ git+https://github.com/KRRT7/pytest-async-benchmark.git@feat/pedantic-mode' pytest-asyncio
python -m pytest tests/benchmarks/test_benchmark_query.py -v -s

Generated by codeflash optimization agent

Add add_terms_batch / add_properties_batch to the index interfaces with executemany-based SQLite implementations. Restructure add_metadata_to_index_from_list and add_to_property_index to collect all items first, then batch-insert via extend() and the new batch methods. Eliminates ~1000 individual INSERT round-trips during indexing.

Rename _collect_{facet,entity,action}_{terms,properties} to drop the leading underscore in propindex.py and semrefindex.py.

Change list to Sequence in add_terms_batch and add_properties_batch interfaces and implementations to satisfy covariance. Add missing add_terms_batch to FakeTermIndex in conftest.py.

lookup_term_filtered called get_item() per scored ref — one SELECT and full deserialization per match. The filter only needs knowledge_type (a plain column) and range (json.loads of range_json), never the expensive knowledge_json deserialization (64% of per-row cost). Add get_metadata_multiple to ISemanticRefCollection that fetches only semref_id, range_json, knowledge_type in a single batch query. Replace the N+1 loop in lookup_term_filtered with one get_metadata_multiple call. Benchmark (200 matches, 200 rounds): 4.38ms → 1.32ms (3.3x speedup).

Apply the same get_metadata_multiple pattern from lookup_term_filtered to four more sites that called get_item() in a loop: - propindex.lookup_property_in_property_index: filter by .range - SemanticRefAccumulator.group_matches_by_type: group by .knowledge_type - SemanticRefAccumulator.get_matches_in_scope: filter by .range - answers.get_scored_semantic_refs_from_ordinals_iter: two-phase metadata filter then batch get_multiple for matching full objects All sites now use a single batch query instead of N individual SELECTs, skipping knowledge_json deserialization where only range or knowledge_type is needed.

…arisons - Use bisect_right with key=start in TextRangeCollection.contains_range to skip O(n) linear scan (O(log n) for non-overlapping point ranges) - Replace TextLocation allocations in TextRange __eq__/__lt__/__contains__ with a shared _effective_end returning tuples - Skip pydantic validation in get_metadata_multiple by constructing TextLocation/TextRange directly from JSON

bmerkle

@KRRT7 please check my comments

KRRT7 · 2026-04-14T23:55:57Z

@bmerkle Heads up — #230 now includes a refactor that will cause merge conflicts with this PR (semrefindex.py, sqlite/propindex.py, propindex.py all overlap). We'll hold off on updating this PR until #230 lands, then rebase on top.

bmerkle

@KRRT7 again thanks for your work.
LGTM, mergiing to main

KRRT7 force-pushed the perf/batch-metadata-query branch from 0fba38f to f9a489b Compare April 10, 2026 10:21

KRRT7 mentioned this pull request Apr 10, 2026

Fix parse_azure_endpoint passing query string to AsyncAzureOpenAI #231

Merged

KRRT7 added 7 commits April 10, 2026 15:49

Remove underscore prefix from collect helper functions

0d9871b

Rename _collect_{facet,entity,action}_{terms,properties} to drop the leading underscore in propindex.py and semrefindex.py.

Fix pyright errors: use Sequence for batch method signatures

bc6b966

Change list to Sequence in add_terms_batch and add_properties_batch interfaces and implementations to satisfy covariance. Add missing add_terms_batch to FakeTermIndex in conftest.py.

Add pytest-async-benchmark as dev dependency

743d8db

KRRT7 force-pushed the perf/batch-metadata-query branch from f9a489b to 743d8db Compare April 10, 2026 20:50

bmerkle self-assigned this Apr 11, 2026

Merge branch 'main' into perf/batch-metadata-query

a08939b

bmerkle reviewed Apr 14, 2026

View reviewed changes

Comment thread src/typeagent/storage/sqlite/collections.py

Comment thread src/typeagent/knowpro/interfaces_indexes.py

Comment thread src/typeagent/storage/memory/propindex.py

Comment thread tests/benchmarks/test_benchmark_query.py

bmerkle added 2 commits April 22, 2026 22:56

Merge branch 'main' into perf/batch-metadata-query

476e66c

minor format changes

4bd60be

bmerkle approved these changes Apr 22, 2026

View reviewed changes

gvanrossum-ms approved these changes Apr 22, 2026

View reviewed changes

gvanrossum approved these changes Apr 22, 2026

View reviewed changes

bmerkle merged commit 6b5113a into microsoft:main Apr 22, 2026
16 checks passed

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

perf: Batch metadata query to avoid N+1 across 5 call sites#232

perf: Batch metadata query to avoid N+1 across 5 call sites#232
bmerkle merged 10 commits intomicrosoft:mainfrom
KRRT7:perf/batch-metadata-query

KRRT7 commented Apr 10, 2026 •

edited

Loading

Uh oh!

bmerkle left a comment

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

KRRT7 commented Apr 14, 2026

Uh oh!

bmerkle left a comment

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

4 participants

Conversation

KRRT7 commented Apr 10, 2026 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Call sites optimized

Additional optimizations

Benchmark

Azure Standard_D2s_v5 — 2 vCPU, 8 GiB RAM, Python 3.13

Query (pytest-async-benchmark pedantic, 200 rounds)

Uh oh!

bmerkle left a comment

Choose a reason for hiding this comment

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

KRRT7 commented Apr 14, 2026

Uh oh!

bmerkle left a comment

Choose a reason for hiding this comment

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

4 participants

KRRT7 commented Apr 10, 2026 •

edited

Loading